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A load cluster management system using SNMP and web 
Myung-Sup Kim, Mi-Joung Choi, James W. Hong 

November 2002 International Journal of Network Management, volume 12 issue 6 
Publisher: John Wiley & Sons, Inc. 

Full text available: pdf(355.47 KB) Additional Information: full citation , abstract , references , index terms 

Clustered servers for Internet service is a popular solution to cope with the explosive 
increase in client requests. The high probability of service failure in cluster servers make 
the cluster management system necessary to provide high availability and convenient 
administrator control. In this paper, we present the design and implementation of a load 
cluster management system (LCMS) based on SNMP and Web technology. Our LCMS 
implementation has been deployed on a commercial ultra-dense server. 

Lessons learned from the novell and groupWise upgrade of the summer 2003 or 
"what we did on our summer vacation!" 
Robert L. Barley, Yancy Phillips 

October 2004 Proceedings of the 32nd annual ACM SIGUCCS conference on User 

services 
Publisher: ACM Press 

Full text available: ^ pdfd 74.72 KB) Additional Information: full citation , abstract , index terms 

This presentation will focus on the effects a recent major hardware and software upgrade 
had on the Indiana State University end user community and how collaboration between 
Technical and User Services resulted in a positive experience for the end user community. 
There were many hours of overtime put into this project and lots of concerns from User 
Services personnel about having to go out and "touch" each computer to facilitate 
necessary changes. The effects of the unsolicited visits produce ... 



Keywords: novell upgrades, reliability, standardization, support 



3 Operating and runtime systems for high-end computin g systems: MOLAR: adaptive Q 
runtime support for high-end computin g operating and runtime systems 
Christian Engelmann, Stephen L. Scott, David E. Bernholdt, Narasimha R. Gottumukkala, 
Chokchai Leangsuksun, Jyothish Varma, Chao Wang, Frank Mueller, Aniruddha G. Shet, P. 
Sadayappan 

April 2006 ACM SIGOPS Operating Systems Review, volume 40 issue 2 
Publisher: ACM Press 
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Full text available:^ pdf(522.07 KB) Additional Information: full citation , abstract , references , index terms 

MOLAR is a multi-institutional research effort that concentrates on adaptive, reliable, and 
efficient operating and runtime system (OS/R) solutions for ultra-scale high-end scientific 
computing on the next generation of supercomputers. This research addresses the 
challenges outlined in FAST-OS (forum to address scalable technology for runtime and 
operating systems) and H EC RTF (high-end computing revitalization task force) activities 
by exploring the use of advanced monitoring and adaptation to i ... 

Keywords: RAS, availability, fault tolerance, group membership, high-end computing, 
monitoring, reliability 



4 Can unstructured P2P protocols survive flash crowds? Q 
Dan Rubenstein, Sambit Sahu 

June 2005 IEEE/ACM Transactions on Networking (TON), Volume 13 issue 3 
Publisher: ACM Press 

Full text available:^) pdf(345.18 KB) Additional Information: full citation , abstract , references , index terms 

Today's Internet periodically suffers from hot spots, a.k.a., flash crowds. A hot spot is 
typically triggered by an unanticipated news event that triggers an unanticipated surge of 
users that request data objects from a particular site, temporarily overwhelming the site's 
delivery capabilities. During this time, the large majority of users that attempt to get 
these objects face the frustrating experience of not being able to retrieve the content they 
want while still being able to communicate ef ... 

Keywords: average case analysis, flash crowds, peer-to-peer 



Results 1 - 4 of 4 

The ACM Portal is published by the Association for Computing Machinery. Copyright © 2006 ACM, Inc. 
Terms of Usage Privacy Policy Code of Ethics Contact Us 

Useful downloads: Adobe Acrobat QuickTime IB Windows Media Player ^> Real Player 




http://portal.acm.org/results.cfm?CFID=1985966&CFTOKEN=60595468&adv=l&COL... 10/25/2006 



Results (page 1): "fail over" +lock 



Page 1 of 5 



6 PORTAL 



USPTO 



Subscribe (Full Service) Register (Limited Service, Free) Login 

Search: ® The ACM Digital Library O The Guide 
| "fail over" -Hock 



Search 



Feedback Report a problem Satisfaction 
surve y 



Terms used fail over lock 



Sort results | re | evance Sf Save results to a Binder 

by I Jsl 

, r ^ Search Tips 

expanded form [i nn ' . 

I — ^ D Open results in a new 

window 



Found 33 of 160 searched out of 160. 

Try an Advanced Search 

Try this search in The ACM Guide 



Display 
results 



Results 1 - 20 of 33 



Result page: 1 2 next 



Relevance scale □□Hi 



1 Manageability, availability, and performance in porcupine: a highly scalable, cluster- 
based mail service 

Yasushi Saito, Brian N. Bershad, Henry M. Levy 

August 2000 ACM Transactions on Computer Systems (TOCS), volume is issue 3 
Publisher: ACM Press 

Full text available: ^ pdf(2.52 MB) Additional Information: full citation , abstract , references , index terms 

This paper describes the motivation, design and performance of Porcupine, a scalable mail 
server. The goal of Porcupine is to provide a highly available and scalable electronic mail 
service using a large cluster of commodity PCs. We designed Porcupine to be easy to 
manage by emphasizing dynamic load balancing, automatic configuration, and graceful 
degradation in the presence of failures. Key to the system's manageability, availability, 
and performance is that sessions, data, and underlying ... 

Keywords: cluster, distributed systems, email, group membership protocol, load 
balancing, replication 



2 Some Deadlock Properties of Computer Systems 
Richard C. Holt 

September 1972 ACM Computing Surveys (CSUR), volume 4 issue 3 
Publisher: ACM Press 

Full text available: Q pdf(1.46 MB) Additional Information: full citation , references , citings , index terms 




3 Research papers: correctness and trust: Middleware based data replication providing pi 
snapshot isolation 

Yi Lin, Bettina Kemme, Marta Patino-Martmez, Ricardo Jimenez-Peris 
June 2005 Proceedings of the 2005 ACM SIGMOD international conference on 

Management of data 
Publisher: ACM Press 

Full text available: ^ pdf(470.92 KB) Additional Information: full citation , abstract , references 

Many cluster based replication solutions have been proposed providing scalability and 
fault-tolerance. Many of these solutions perform replica control in a middleware on top of 
the database replicas. In such a setting concurrency control is a challenge and is often 
performed on a table basis. Additionally, some systems put severe requirements on 
transaction programs (e.g., to declare all objects to be accessed in advance). This paper 
addresses these issues and presents a middleware-based replica ... 
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Frontmatter (TOC , Letters, Philos ophy of comp uter s cience, Interviewers need ed, Q 
Takin g software requirements creation from folklore to analysis, SW components and 
product lines: from business to systems and technology. Software engineering 
survey ) 

September 2005 ACM SIGSOFT Software Engineering Notes, volume 30 issue 5 
Publisher: ACM Press 

Full text available: pdf(1.98 MB) Additional Information: full citation , index terms 



Frangipani: a scalable distributed file system j 
Chandramohan A. Thekkath, Timothy Mann, Edward K. Lee 

October 1997 ACM SIGOPS Operating Systems Review , Proceedings of the sixteenth 
ACM symposium on Operating systems principles SOSP '97, volume 31 issue 
5 

Publisher: ACM Press 

Full text available: |||pdf(2.2Q MB) Additional Information: full citation, re ference s, citings, index terms 



6 Hig h Availability Cluster Checklist 
Tim Burke 

November 2000 Linux Journal 

Publisher: Specialized Systems Consultants, Inc. 

Full text available: g| html(17.23 KB ) Additional Information: full citation , abstract , index terms 

With a variety of clustering services on the market, the ability to determine how well 
options meet your specific business needs is necessary. 

7 Q focus: databases: Be yo nd relational databases 
Margo Seltzer 

April 2005 Queue, volume 3 issue 3 
Publisher: ACM Press 

Full text available:^ pdf(654.63 KB) AJJ . A . t(f 

hi] html(35.21 KB) Add,tl0nal Information: full citation , abstract , references , index terms 

There is more to data access than SQL. 

8 Queue Focus: Game Development: Massively Multiplaver Middleware 
Michi Henning 

February 2004 Queue, volume 1 issue 10 
Publisher: ACM Press 

Full text available:^ pdf( 2.34 MB) 111 ...... .. 

u? "V^m A « Additional Information: full citation , index terms 

html(28.00 KB) 



9 Potpourri: Fast and transparent recovery for continuous availability of cluster-based Q 
servers 

Rosalia Christodoulopoulou, Kaloian Manassiev, Angelos Bilas, Cristiana Amza 
March 2006 Proceedings of the eleventh ACM SIGPLAN symposium on Principles and 

practice of parallel programming PPoPP a 06 
Publisher: ACM Press 

Full text available: |S| pdfd 11.02 KB) Additional Information: full citation , abstract , references , index terms 

Recently there has been renewed interest in building reliable servers that support 
continuous application operation. Besides maintaining system state consistent after a 
failure, one of the main challenges in achieving continuous operation is to provide fast 
reconfiguration. The complexity of the failure reconfiguration mechanisms employed and 
their overheads depend on the type of platform that is being used as a server and the 
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types of applications that need to be supported. In this paper we foe ... 

Keywords: availability, distributed shared memory, fast failure reconfiguration, fault 
tolerance, scalability 

10 A new a p proach to deve lopi n g and implementing eager database replication Q 
protocols 

Bettina Kemme, Gustavo Alonso 

September 2000 ACM Transactions on Database Systems (TODS), volume 25 issue 3 
Publisher: ACM Press 

Full text available* fi3 Ddf(449 43 KB) Additional Information: full citation , abstract , references , citings , index 

: terms, review 

Database replication is traditionally seen as a way to increase the availability and 
performance of distributed databases. Although a large number of protocols providing 
data consistency and fault-tolerance have been proposed, few of these ideas have ever 
been used in commercial products due to their complexity and performance implications. 
Instead, current products allow inconsistencies and often resort to centralized approaches 
which eliminates some of the advantages of replication. As an ... 

Keywords: database replication, fault-tolerance, group communication, isolation levels, 
one-copy-serializability, replica control, total error multicast 



1 1 Affinit y -ba sed managem ent o f main memo r y d atab a s e clu s t er s 

#Minwen Ji 
November 2002 ACM Transactions on Internet Technology (TOIT), volume 2 issue 4 
Publisher: ACM Press 

Full text available: ^ p df( 553.96 KB ) Additional Information: full citation, abstract , references, index terms 

We study management strategies for main memory database clusters that are interposed 
between Internet applications and back-end databases as content caches. The task of 
management is to allocate data across individual cache databases and to route queries to 
the appropriate databases for execution. The goal is to maximize effective cache capacity 
and to minimize synchronization cost. We propose an affinity- based management system 
for main memory database cLUsters (ALBUM). ALBUM executes ea ... 

Keywords: Main memory database, clustering, database administration, database 
cluster, file organization, query affinity, scalability 
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12 AFS— a secure distributed filesvstem. Part III 
Alf Wachsmann 

April 2005 Linux Journal, volume 2005 issue 132 
Publisher: Specialized Systems Consultants, Inc. 

Full text available: g) html(22.05 KB) Additional Information: full citation , abstract , index terms 

Reconfigure servers without changing mount points on the clients with this Kerberos- 
authenticated network filesystem. 

13 Versatility and Unix semantics in namespace unification 
Charles P. Wright, Jay Dave, Puja Gupta, Harikesavan Krishnan, David P. Quigley, Erez 
Zadok, Mohammad Nayyer Zubair 

February 2006 ACM Transactions on Storage (TOS), volume 2 issue 1 
Publisher: ACM Press 

Full text available: ^g pdf(317.82 KB) Additional Information: full citation, abstract , references , index terms 

Administrators often prefer to keep related sets of files in different locations or media, as 
it is easier to maintain them separately. Users, however, prefer to see all files in one 
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location for convenience. One solution that accommodates both needs is virtual 
namespace unification— providing a merged view of several directories without physically 
merging them. For example, namespace unification can merge the contents of several 
CD-ROM images without unpacking them, merge binary directories fr ... 

Keywords: Namespace management, directory merging, snapshotting, stackable file 
systems, unification 



14 Research sessions: consistency and availabilit y: Hi ghly available, fault-tolerant . Q 
parallel dataflows 

Mehul A. Shah, Joseph M. Hellerstein, Eric Brewer 
June 2004 Proceedings of the 2004 ACM SIGMOD international conference on 
Management of data 

Publisher: ACM Press 

Full text available: ^ pdf(210.17 KB) Additional Information: full citation , abstract , references , citings 

We present a technique that masks failures in a cluster to provide high availability and 
fault-tolerance for long-running, parallelized dataflows. We can use these dataflows to 
implement a variety of continuous query (CQ) applications that require high-throughput, 
24x7 operation. Examples include network monitoring, phone call processing, click-stream 
processing, and online financial analysis. Our main contribution is a scheme that carefully 
integrates traditional query processing techniques for ... 

15 Poster session: papers included: System centric mission critical simulation model for Q 
MES automation 

Amit Jindal, Rajkumar Khandelwal 

December 2005 Proceedings of the 37th conference on Winter simulation WSC '05 
Publisher: Winter Simulation Conference 

Full text available: ^pdf(217.40 KB) Additional Information: full citation , abstract, references 

This paper describes the system centric simulation methodology used for stress testing of 
Manufacturing Execution System (MES) in Intel. System centric simulation involves 
testing such that the system components (infrastructure stack and software) are 
characterized for the load they would experience in production, irrespective of how that 
load is exerted. A new manufacturing execution system software is introduced in Intel's 
latest fabrication facility. Validation of the product under stress is ... 

16 Manageability , availability and performance in Porcupine: a hi g hly scalable , cluster- Q 
^ based mail service 

^ Yasushi Saito, Brian IM. Bershad, Henry M. Levy 

December 1999 ACM SIGOPS Operating Systems Review , Proceedings of the 

seventeenth ACM symposium on Operating systems principles SOSP 

'99, Volume 33 Issue 5 
Publisher: ACM Press 

Full text available: 1S] pdf(1.62 MB) Additional Information: full citation , abstract, references , citings, index 

terms 

This paper describes the motivation, design, and performance of Porcupine, a scalable 
mail server. The goal of Porcupine is to provide a highly available and scalable electronic 
mail service using a large cluster of commodity PCs. We designed Porcupine to be easy to 
manage by emphasizing dynamic load balancing, automatic configuration, and graceful 
degradation in the presence of failures. Key to the system's manageability, availability, 
and performance is that sessions, data, and underlying serv ... 

17 Database replication with Slon y-I Q 
Ludovic Marcotte 

June 2005 Linux Journal, volume 2005 issue 134 
Publisher: Specialized Systems Consultants, Inc. 

Full text available: g) html(22.00 KB) Additional Information: full citation , abstract , index terms 
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Move up to a- highly available cluster without leaving behind the open-source database 
you trust. 



18 Session 7: OS architecture II: Increasing relevance of memory hardware errors: a 
case for recoverable programming models 

Dejan Milojicic, Alan Messer, James Shau, Guangrui Fu, Alberto Munoz 
September 2000 Proceedings of the 9th workshop on ACM SIGOPS European 

workshop: beyond the PC: new challenges for the operating system 

Publisher: ACM Press 

Full text available: ^ pdf(99.65 KB) Additional Information: full citation , abstract , references , citings 

It is a common belief that most of computer system failures nowadays stem from 
programming errors. Computer systems are becoming more complex and harder to 
maintain and administer, making software errors an even more common case, while 
contemporary computer architectures are optimized for price and performance and not for 
availability. In this paper, we raise a case for an increasing relevance of memory 
hardware soft-errors. In particular with the introduction of 64-bit processors, memory 
scalin ... 




1 9 Experience reports: software architecture II: The co-evolution of a h y pe and a 
software architecture: experience of component-producin g large-scale EJB early 
adopters 
Lutz Prechelt 

May 2003 Proceedings of the 25th International Conference on Software 
Engineering 

Publisher: IEEE Computer Society 

Full text available: fg pdf(367.70 KB) 

Jsf Additional Information: full citation , abstract , references , index terms 

^ Publisher Site 

abaXX. components was one of the first API software products fully based on Enterprise 
JavaBeans™ (EJB) technology. We describe the evolution of its architecture as it moved 
from simply taking the initial EJB hype for the truth, through several intermediate stages, 
to using EJB simply as one of several encapsulated implementation techniques. So far, the 
public perception of how to use EJB properly evolved along a similar path, lagging 6 to 12 
months behind. 



20 Posters: Data versionin q techniques for internet transaction management Q 
^ Ramkrishna Chatterjee, Gopalan Arun 

>^ May 2005 Special interest tracks and posters of the 14th international conference on 
World Wide Web 

Publisher: ACM Press 

Full text available: ^gpdf (1 34.78 KB) Additional Information: full citation , abstract , references , index terms 

An Internet transaction is a transaction that involves communication over the Internet 
using standard Internet protocols such as HTTPS. Such transactions are widely used in 
Internet-based applications such as e-commerce. With the growth of the Internet, the 
volume and complexity of Internet transactions are rapidly increasing. We present data 
versioning techniques that can reduce the complexity of managing Internet transactions 
and improve their scalability and reliability. These techniques have ... 

Keywords: internet transaction, scalability, versioning 
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21 S ystems and p rototypes: Phoenix project: fault-tolerant a p plications Q 
Roger Barga, David Lomet 

June 2002 ACM SIGMOD Record, volume 31 issue 2 
Publisher: ACM Press 

Full text available: ^ pdf( 847.56 KB ) Additional Information: full citation , abstract , references , citings 

After a system crash, databases recover to the last committed transaction, but 
applications usually either crash or cannot continue. The Phoenix purpose is to enable 
application state to persist across system crashes, transparent to the application program. 
This simplifies application programming, reduces operational costs, masks failures from 
users, and increases application availability, which is critical in many scenarios, e.g., e- 
commerce. Within the Phoenix project, we have explored how to ... 



22 Industrial session: potpourri: Recovery principles of MvSQL Cluster 5.1 
Mikael Ronstrom, Jonas Oreland 

August 2005 Proceedings of the 31st international conference on Very large data 
bases VLDB a 05 

Publisher: VLDB Endowment 

Full text available: ^ pdf(299.62 KB) Additional Information: full citation , abstract , references , index terms 

MySQL Cluster is a parallel main memory database. It is using the normal MySQL 
software with a new storage engine NDB Cluster. MySQL Cluster 5.1 has been adapted to 
also handle fields on disk. In this work a number of recovery principles of MySQL Cluster 
had to be adapted to handle very large data sizes. The article presents an efficient 
algorithm for synchronizing a starting node with very large data sets. It provides reasons 
for the unorthodox choice of a no-steal algorithm in the buffer manag ... 

23 A novel Fuzzy Logic Controller (FLC) for shortening the TCP channel roundtrip time 
by eliminatin g user buffer overflow adaptively 

Wilfred W. K. Lin, Allan K. Y. Wong, Tharam S. Dillon 

January 2005 Proceedings of the Twenty-eighth Australasian conference on Computer 
Science - Volume 38 ACSC '05 

Publisher: Australian Computer Society, Inc. 

Full text available: ^pdf( 951.23 KB ) Additional Information: full citation , abstract , references , index terms 

The proposed Fuzzy Logic Controller (FLC) is a novel approach for dynamic buffer tuning 
at the user/server level. It eliminates buffer overflow by ensuring that the buffer length 
always cover the queue length adaptively. The FLC and the AQM (active queue 
management) mechanisms at the router/system level together form a unified solution to 
stifle TCP (Transmission Control Protocol) channel buffer overflow over the Internet. The 
FLC contributes to: a) prevent the AQM resources dished out at ... 
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Keywords: FLC, PIDC, active queue management, buffer overflow, dynamic buffer 
tuning, internet, traffic pattern 



24 Cryptography as an operating system service: A case study 

#Angelos D. Keromytis, Jason L. Wright, Theo De Raadt, Matthew Burnside 
February 2006 ACM Transactions on Computer Systems (TOCS), volume 24 issue l 
Publisher: ACM Press 

Full text available: ^g pdf(669.12 KB ) Additional Information: full citation , abstract , references , index terms 

Cryptographic transformations are a fundamental building block in many security 
applications and protocols. To improve performance, several vendors market hardware 
accelerator cards. However, until now no operating system provided a mechanism that 
allowed both uniform and efficient use of this new type of resource.We present the 
OpenBSD Cryptographic Framework (OCF), a service virtualization layer implemented 
inside the operating system kernel, that provides uniform access to accelerator functio 

Keywords: Encryption, authentication, cryptographic protocols, digital signatures, hash 
functions 



An open architecture for next-generation telecommunication services 
Gregory W. Bond, Eric Cheung, K. Hal Purdy, Pamela Zave, J. Christopher Ramming 
February 2004 ACM Transactions on Internet Technology (TOIT), volume 4 issue l 
Publisher: ACM Press 

Full text available: 15} pdf(237.24 KB) Additional Information: full citation, abstract , references , citings, index 

terms 

An open (in the sense of extensible and programmable) architecture for IP 
telecommunications must be based on a comprehensive strategy for managing feature 
interaction. We describe our experience with BoxOS, an IP telecommunication platform 
that implements the DFC technology for feature composition. We present solutions to 
problems, common to all efforts in IP telecommunications, of feature distribution, 
interoperability, and media management. We also explain how BoxOS addresses many 
deficiencie ... 

Keywords: Component architectures, Intelligent Network architecture, Session Initiation 
Protocol, electronic mail, feature interaction, instant messaging, multimedia systems, 
network addressing, network interoperation, network optimization, network protocols, 
service creation 



26 Mi g ration: Optimizing the mi g ration of virtual computers Q 
^ Constantine P. Sapuntzakis, Ramesh Chandra, Ben Pfaff, Jim Chow, Monica S. Lam, Mendel 

^ Rosenblum 

December 2002 ACM SIGOPS Operating Systems Review, volume 36 issue si 
Publisher: ACM Press 

Full text available: ^ gpdfd.68 MB) Additional Information: full citation , abstract , references , citings 

This paper shows how to quickly move the state of a running computer across a network, 
including the state in its disks, memory, CPU registers, and I/O devices. We call this state 
a capsule. Capsule state is hardware state, so it includes the entire operating system as 
well as applications and running processes.We have chosen to move x86 computer states 
because x86 computers are common, cheap, run the software we use, and have tools for 
migration. Unfortunately, x86 c ... 

27 An XML query engine for network-bound data Q 
Zachary G. Ives, A. Y. Halevy, D. S. Weld 

December 2002 The VLDB Journal — The International Journal on Very Large Data 

Bases, Volume 11 Issue 4 
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Publisher: Springer-Verlag New York, Inc. 

Full text available: ^g pdf(351.86 KB) Additional Information: full citation , abstract , citings , index terms 

XML has become the lingua franca for data exchange and integration across 
administrative and enterprise boundaries. Nearly all data providers are adding XML import 
or export capabilities, and standard XML Schemas and DTDs are being promoted for all 
types of data sharing. The ubiquity of XML has removed one of the major obstacles to 
integrating data from widely disparate sources - namely, the heterogeneity of data 
formats. However, general-purpose integration of data across the wide are a also re ... 

Keywords: Data integration, Data streams, Query processing, Web and databases, XML 



28 Articles: The Deliberate Revolution 
Mike Burner 

March 2003 Queue, volume l issue l 
Publisher: ACM Press 
Full text available: g] pdf (326. 81 KB) 
m html(64.11 KB) 



Additional Information: full citation , index terms 



29 Run-time adaptation in river 
^ Remzi H. Arpaci-Dusseau 

February 2003 ACM Transactions on Computer Systems (TOCS), volume 21 issue 1 

Publisher: ACM Press 

Full text available: pdf(849.04 KB) Additional Information: full citation , a bstract , references , index terms 

We present the design, implementation, and evaluation of run-time adaptation within the 
River dataflow programming environment. The goal of the River system is to provide 
adaptive mechanisms that allow database query-processing applications to cope with 
performance variations that are common in cluster platforms. We describe the system and 
its basic mechanisms, and carefully evaluate those mechanisms and their effectiveness. 
In our analysis, we answer four previously unanswered and important que ... 

Keywords: Performance availability, clusters, parallel I/O, performance faults, robust 
performance, run-time adaptation 



30 Industry track papers and presentations: technology trends: Building enterprise 
^ portals: principles to practice 

^ Tushar K. Hazra 

May 2002 Proceedings of the 24th International Conference on Software 

Engineering 
Publisher: ACM Press 

Full text available: ^ pdf( 1 .85 MB) Additional Information: full citation , abstract , references , index terms 

Primary objective of this paper is to offer an exclusive view of constructing and deploying 
enterprise portals by using a component-based development approach. As the dot-com 
hype dies down, most companies are forced to revisit their enterprise-wide Web 
integration strategies. This paper offers a pragmatic roadmap that these companies may 
follow in their upcoming enterprise portal deployment initiatives.The academic world plays 
a significant role in the advances of the portal technology. In this ... 

31 Business Continuity Plannin g 
Martin Nemzow 

July 1997 International Journal of Network Management volume 7 issue 3 
Publisher: John Wiley & Sons, Inc. 

Full text available: *jg pdf (295.36 KB) Additional Information: full citation , abstract , references , index terms 
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This article considers various strategies for protecting an organisation from both natural 
and man-made disasters. The differences between business continuity planning, and 
disaster recovery planning are recognised. © 1997 John Wiley & Sons, Ltd. 

32 Measurement tools: Robust synchronization of software clocks across the internet Q 
Darryl Veitch, Satish Babu, Attila Pasztor 

October 2004 Proceedings of the 4th ACM SIGCOMM conference on Internet 
measurement 

Publisher: ACM Press 

Full text available: ^pdf(1.59 MB) Additional Information: full citation , abstract , references , index terms 

Accurate, reliable timestamping which is also convenient and inexpensive is needed in 
many important areas including real-time network applications and network 
measurement. Recently the TSC register, which counts CPU cycles in popular PC 
architectures, was proposed as the basis of a new software clock which in terms of rate 
performance performs as well as more expensive GPS alternatives. Smooth and precise 
clock rate is essential to measure time <i>differences</i> accurately. We show ... 

Keywords: GPS, NTP, network measurement, round-trip time, software clock, 
synchronization, timing 

33 Implementin g a network improvement action plan Q 
Kim Todd, Jon Rickman, Tabatha Verbick 

October 2004 Proceedings of the 32nd annual ACM SIGUCCS conference on User 

services 
Publisher: ACM Press 

Full text available: |B|pdf(182.43 KB) Additional Information: full citation , abstract , index terms 

Firewall failures and the recent onslaught of computer viruses and worms, such as Klez, 
loveletter, SoBig, and BadBoy, increased network traffic and ever-growing network abuse, 
propelled security awareness at Northwest Missouri State University to a higher level and 
increased the need for finding and implementing effective solutions. 

Northwest's Information Systems Department was able to build awareness of computer 
security issues, decrease network abuse, institute significant enhance ... 
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terms, review 

This article presents our experience using optimistic synchronization to implement fine- 
grain atomic operations in the context of a parallelizing compiler for irregular, object- 
based computations. Our experience shows that the synchronization requirements of 
these programs differ significantly from those of traditional parallel computations, which 
use loop nests to access dense matrices using affine access functions. In addition to 
coarse-grain barrier synchronization, our irregular comput ... 
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We propose some architectural principles we have found useful for the support of 
continuous media applications in a microkernel environment. In particular, we discuss i) 
the principle of upcall-driven application structuring whereby communications events are 
system rather than application initiated, ii) the principle of split-level system structuring 
whereby key system functions are carried out co-operatively between kernel and user 
level components and iii) the principle of decoupling of contro ... 
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Ivy is a multi-user read/write peer-to-peer file system. Ivy has no centralized or dedicated 
components, and it provides useful integrity properties without requiring users to fully 
trust either the underlying peer-to-peer storage system or the other users of the file 
system.An Ivy file system consists solely of a set of logs, one log per participant. Ivy 
stores its logs in the DHash distributed hash table. Each participant finds data by 
consulting all logs, but performs modifications by appendi ... 
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Farsite is a secure, scalable file system that logically functions as a centralized file server 
but is physically distributed among a set of untrusted computers. Farsite provides file 
availability and reliability through randomized replicated storage; it ensures the secrecy of 
file contents with cryptographic techniques; it maintains the integrity of file and directory 
data with a Byzantine-fault-tolerant protocol; it is designed to be scalable by using a 
distributed hint mechanism and delegatio ... 
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Failure- resilient, scalable, and secure read-write access to shared information by mobile 
and static users over wireless and wired networks is a fundamental computing challenge. 
In this article, we describe how the Coda file system has evolved to meet this challenge 
through the development of mechanisms for server replication, disconnected operation, 
adaptive use of weak connectivity, isolation-only transactions, translucent caching, and 
opportunistic exploitation of hardware surrogates. For eac ... 
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Pangaea is a wide-area file system that supports data sharing among a community of 
widely distributed users. It is built on a symmetrically decentralized infrastructure that 
consists of commodity computers provided by the end users. Computers act 
autonomously to serve data to their local users. When possible, they exchange data with 
nearby peers to improve the systerrTs overall performance, availability, and network 
economy. This approach is realized by aggressively creating a replica of a file w ... 
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